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Abstract — Information model portal of scientific knowledge 
which should provide substantial access to scientific information 
resources is proposed. Information model combines the models 
of domain and problem fields of the portal, as well as describes 
the types of the presented information. On the base of this model, 
the internal portal data storage is constructed, its information 
content, navigation and search are organized. 

Index Terms — Ontology, knowledge portal, information 
model, scientific information resources 


I. INTRODUCTION 

Currently, the problem of the effective use of the 
vast amount knowledge and information resources 
accumulated in various areas of human activity is very acute. 
However, access to these knowledge and resources is 
significantly limited due to the fact that they are poorly 
structured, poorly systematized, and dispersed across various 
Internet sites, libraries and archives. To solve this problem, an 
approach to the construction of specialized Internet 
knowledge portals [1,2] ensuring the integration of 
accumulated knowledge and information resources in a 
certain field of scientific knowledge and substantial access to 
them is proposed. 

An information model is proposed to provide a 
unified presentation of diverse knowledge of data, accounting 
for their connectivity, as well as supporting the functionality 
of such portals, the basis of which is ontology. The paper is 
devoted to the description of this model. 

n. Information model portal of the knowledge 

Information model portal of the knowledge 
combines the models of domain and problem fields, as well as 
describes the types presented in the information portal. 
Formally the information model of the portal M is 
described by two: 

M p =(o p ,IC p ), 

where O is the ontology of the portal; IC is 
information content of the portal. 

Ontology is the core, basic component of 
information model of the portal. Not only it describes the 
knowledge system of the portal, but also gives a formal 
structure for the presentation of its content. 

In the paper ontology conception is used in the sense 
that how it is applied to the computer science and artificial 
intelligence. In particular, we adhere to the determination of 
the ontology given in the paper according to which the 
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ontology is an exact specification of conceptualization. Here, 
some abstraction is realized by the conceptualization, i.e. a 
simplified view of the world constructed for a specific 
purpose. Conceptualization includes the objects, conception 
and other entities of the studied fields, as well as the relation 
between them. 

For the presentation of ontology, the formalism 
ensuring the description of the problem and domain fields 
conception of the portal and diverse semantical relations 
between them is necessary. An important demand for it is the 
ability to construct the concepts of the domain field (DF) in 
the hierarchy of "general-private" and support for the 
inheritance of properties in this hierarchy. This formalism 
should also provide the ability to set constraints on the 
properties of domain fields and the organization of logical 
inference. 

The formalism meeting the above mentioned needs, 
which is presented as a met ontology form, is proposed in the 
paper. 

O =< C, R, T, DA, F, Ax) (1) 

where 

C = {C 1 ,...,C n } is the final non-empty set of 
classes describing the conception of some domain and 
problem fields: 

/? = {/?!„.. ,i?J, R ( CCxC,R = R T \jR p \jR A is 

the final set of binary relations given in the classes (concepts); 
here, R r is antisymmetric, transitive, unreflective binary 
relations of the inheritance giving partial order in the set of 
concepts C ; R p is binary transitive relation of the inclusion 
(partial- whole); R A is the final set of associative relations; 
T is a set of standard types; D = \d v . . ,,d n \ is a set of 
domains; d l = , . . ., s k ) is the value of standard types s t ; 

TD = T UZ)is a generalized data type including a 
set of standard types and a set of domains; 

A = {a v ...,a w ) , A^CxTDUR a xTD is 
the final set of attributes, i.e. binary ratio of the form 
a ; . (Cj,td k ) or (Rj , td k jdescribing the properties of the 
concepts C and ratio R a ; 

F is a set of restrictions on the value of concepts 
and ratio attributes, i.e. predicates of the 
form p i {e l ,...,e m ), where e k is the name of either attribute 
{e k £/l) or constant [e k ^fdj, where td j G ZD); 

A x is a set of axiom determining semantic groups 
and ontology ratio. 

The ratio R t serves to define the hierarchies on 
concepts. Its peculiarity is that while inheriting from a parent 
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class, its class-descendant is passed not only to all the 
attribute, but also their relations. 

The inclusion relation R p is endowed with the 


transitivity property, due to while searching for the objects 
(direct instance of classes), it is possible to carry out a 
transitive closure in this relation. 

A set of associative relations R A is determined by 
need for the knowledge about the links between the objects in 
specific problem and subject areas. The proposed formalism 
enables to specify such relations attributes specializing the 
relationship between the arguments (objects). 

Information content portal 1C is constructed on 


the base of formal structures specified in meta-ontology O 
(formula (1)): 

1C p = ^I,R n V ,A i ,R ic ,R ir ,R ia ,R vtd ' s ) , 
( 2 ) 

where 

/ = {z, , . . .,Z n }is the final set of direct instance 
classes of ontology; 

Rj = {ri [ ,...,ri k ] is the final set of specified 
relations (instances of relations) ri i {jj,i k ) between 
instances of classes. 

V = {vp. . .,V q }is the final set of specific value of 


the generalized type TD; 

Aj = {ai \ . .,Cli w } is the final set of specified 
attributes, i.e. binary relations k x > v y ) or a th^Th> v y) 
between instances of classes or the ration of specific values; 

R IC C/ xC is a binary incidence relation between 


a set of instances 1 and a set of classes C ; 

R m is a binary incidence relation 

between a set of instances R r and a set of relations R ; 


R ia ^A 1 xA is a binary incidence relation 
between a set of specified attributes A t and a set A ; 

R wd <= VxTD is a binary incidence relation 
between a set of values V and generalized type TD. 


III. Knowledge portal ontology 


From the substantive point of view, the ontology of 
the portal knowledge serves to present the concepts needed 
for describing both scientific activity and scientific 
knowledge on the whole, and particularly specific knowledge 
area. 

In order to simplify the configuration of the portal to 
the selected area of the knowledge, the ontology of the portal 
is divided into subject-independent (basic) ontologies and the 
knowledge portal domain ontology (subject ontology). 

The ontologies of scientific activity O RA and 
scientific knowledge O sk , which doesn’t depend on the 
domain of knowledge portal, are chosen as a base. These 
ontologies are constructed on the base of the meta-ontology 
formalism O described in section 1 . 


The portal knowledge domain can include several 
related subject areas. In this connection, the ontology of the 
knowledge portal domain 0 KD can include the ontologies of 
several subject domains O soi constructed on the base of 
scientific knowledge ontology O sk . 

Thus, the knowledge portal ontology O p has the 
following form: 

Op = { Ora ’ O kd ) , 

where 0 KA is scientific activity ontology; 
Okd = \0 SDj O SOm } is knowledge portal domain. 

The scientific activity ontology acting as an ontology 
of the problem domain of the knowledge portal is constructed 
on the base of the ontology proposed in [6] for the description 
of research projects, and in fact it is an upper level ontology. It 
includes basic classes of concepts related to the organization 
of scientific and research activities, such as the Explorer, 
Organization, Event, Activity (project). Publication, and so 
on. This ontology also includes the Information resource, 
which serves to describe the information resources presented 
on the Internet. 

The scientific knowledge ontology fixes the basic 
content structures used to construct subject-matter ontologies. 
In particular, this ontology contains meta-concepts defining 
the structures to describe the concepts of a particular 
knowledge area as: Science Section, the Research Method, 
the Object of Research, the Subject of Research, the Scientific 
Result. 

The concepts of each subject ontology are 
realizations of the meta-concepts of scientific knowledge 
ontology and can be ordered in the "general-private" 
hierarchy. 

The concepts of basic ontologies are related to each 
other by associative relations, the choice of which was made 
not only based on the completeness presentation of the 
problem and subject domains of the portal, but also taking 
into account the ease of navigating through its content and 
information retrieval therein. 

The most important associative relations are as 

follows: 

“scientific direction” - connects events, publications, 
organizations, researches by science sections; 

• “describes” - specifies the relationship of the 
publication with the scientific result, object or research 
method; 

• “uses” - connects the research method to the 
activity, researches or science section; 

• “applies to” - relates the research method to the 
research object; 

• “result” - connects the scientific result with the 

activity; 

• “resource” -connects an information resource with 
events, publications, researches, methods and research 
objects; 

• “works in” - connects the researcher with the 
organization which he works for. 

It should be noted that, the last relation has three 
additional attributes: position, acceptance date and dismissal 
date, serving to clarify, in which position and since when the 
researcher worked in the organization. 
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The axioms included in the basic ontologies allow to 
derive additional associative relations between objects. 

As a sample of such an axiom, an axiom that derives 
a new associative relation “works _ in” is given: 

Al : if works _ in , && includes 

[o 2 ,oJ then works in {m v 0 2 ) , M x G= Researcher, 
O l , 0 2 £= Organization . 

IV. Presentation of data and information resources 

By introducing formal descriptions of the knowledge 
domain concepts in the form of object classes and relations 
between them, the portal ontology defines the structures for 
representing real objects and the relationships between them. 
In accordance with this, the data on the portal is presented as a 
set of diverse information objects and connections, which 
together constitute the information or portal content. 

Formally, each information object (IO) is an element 
of the set I, and the connection between information objects - 
the elements of the class R| (see the formula (2)). 

The content of the IO is an instance of any ontology 
class and presents the description of some objects of the 
modeled knowledge area or the information resource relevant 
to it. The semantics of the connections established between 
the IO is determined by the relationships specified between 
the corresponding classes of the ontology. 

An important component of the portal content is the 
description of information resources. As mentioned above, 
each resource presented on the portal corresponds to such an 
ontology concept as the Information resource, the set of 
attributes and the connections of which is based on the 
standard Dublin Core [7]. The description of the resource 
includes an instance concept of the Information resource and 
a set of instances of the relations connecting this concept with 
other concepts of ontology ( Organizations , Researchers, 
Publications, Events, Sections and so on). 


V. Organization of substantial access 

On the base of proposed information model, not only 
the content of the knowledge portal is constructed, but also 
substantial access to the systematized knowledge and 
information resources of the modeled knowledge area is 
organized. This function ensures advanced navigation and 
search facilities. 

A. Navigation 

While navigating through the portal, it is possible to 
select information objects of a certain class, a detailed view of 
each IO, as well as information resources, connections in 
which are contained in the viewed IO. 

Navigation through the portal begins with the 
selection of a certain class in the ontology concepts tree 
constructed on the base of relation R T . In this case, the user 
is given a list of information objects of the selected class 
L (C x ) , which is shown in the form html-page containing a 


set of links to these objects. For large lists of objects, a 
composite page is formed including a list of pages with 
navigation elements on this list. 

It should be noted that while forming a list of the 
objects of the selected class C x , a transitive closure with 
respect to R T takes place. In consequence of this resulting 
list l{c x ) will include both objects of the required class 
C x and the objects of its classes of successors i.e. 

l(c x ) = ^1% e C x uc r }, 

where C x £=C , 

C T ={c Y gc| 3R r (c x , Cy )} 

If on information objects of class C x the ratio of 
inclusion R p is given at the user's request, a set of the objects 
included in L (C x )can be represented either as a list or as a 
tree. 

Information about a particular object and its 
connections are also displayed in the form html-page. (See the 
figure), the format and content of which depend on the class 
of this object and the relations defined for it and visualization 
template. In this case, the objects associated with this object 
are presented on its page in the form of hyperlinks, by which 
one can go to their detailed description. 

Further navigation through the portal is a process of 
transition from one information object to another according to 
the relationships established between them - instances of 
associative relations R A . 

For example, while viewing information about a 
particular project (see the figure), one can see not only the 
values of its attributes, but also its connections with other 
objects. Using the presented connections as navigation 
elements, one can proceed to view detailed information about 
it as for direct relations (about the research object, about the 
research methods used in the project and the scientific results 
obtained in the course of its implementation), and on the 
reverse (about the participants of the project, describing its 
publications and information resources). 

B. Search 

The search for information objects is based on the 
ontology due to which the user is given an opportunity to set 
the query in terms of the portal knowledge area. 

Search queries are specified through a special 
graphical interface controlled by the knowledge portal 
ontology. When a user selects a class of required information 
objects, a search form is automatically generated in which you 
can set constraints on the values of the attributes of the objects 
of the selected class, as well as on the values of the attributes 
of the objects associated with this object by associative 
relationships. In this case, the conditions for the values of the 
attributes of the corresponding relations can also be specified. 

Formally, the search query constructed in terms of 
the ontology is as follows: 

Q = [i x (EC X \p(i x ) £ Rd(i x ) & Rb(i x )}, 
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where C x GC is the class of required objects. 

p(i x ) = &{a t {i x ,Vj)& & Pj {vj , Cj ) is the 

description of the required object properties i x class, where 
n is the number of user-defined attributes used, Cl t ; m is the 
number of user-defined predicates p, on the value V ■ 
attributes a, ; c ■ are constants 
Cj £ tdj , where tdj <^TD ; 

Rd(i x )= &(ri k {i x ,i Y ),ri k ^R k , i Y ^C Y \p{ri k 
is the description of the objects consisting of the objects of 
class C x in “direct” relations, where d is the number of 
user-defined “direct” relations ri k - are descriptions of the 
instance relation properties and properties of the object 
R k <ER p U R a ; p{ri k ) and Pity) consistently, the 
structure of which is similar to the description structure 

H,); 

Rb(i x )= &(ri t (iy.iy ), ri t ~R , i z fcC z p( ri 
is the description of the objects consisting of the objects of 
class C x in “inverse” relations, where b is the number of 
user defined “reverse” relations R l £=/? U/? A ; P{ri k ) 
and P{i z )- descriptions of the relation instance properties 
ri j and the properties of the object i z respectively. 

As a result of the query implementation Q , the 
search for the objects of a given class C x can be performed 
(at the user's request) taking into account the transitive closure 
with respect to the ratio R T . In this case, when processing the 
request, a set of objects will be considered 
l(c x ) = GC, UC 7 .} where 

C T =[C K ^C\3R T {C X ,C K )}. 

As a result of the search, or while passing through a 
specific connection of any IO, can be obtained by a 
sufficiently large list of objects L {C x ) (for example, a list 
of all the participants in a major project or conference). To 
manage this situation, a filtering mechanism is introduced, 
which allows to select from the list L (C x ) by specifying a 
filter. 

The content of the filter is a set of conditions 
determining the permissible values of the attributes of 
information objects and the requirements for the existence of 
connection with other IO. Formally, the filter is specified by 
an expression similar to the search query. 

This method allows, for example, to filter out the set 
of project participants both by age or scientific degree 
(conditions for the attribute), and by the research methods 
they use (conditions on the associated object). 


Conclusion 

The article considers the information model of the 
scientific knowledge portal providing the user with 
substantial access to the systematized knowledge and 
information resources of a certain knowledge field. This 
model not only provides a formal description of the system 
concepts of the problem domain of the knowledge portal, but 
also supports all its functionality. On the base of his model, 
the knowledge system and internal data storage of the portal 
are constructed, its content, navigation and search are 
organized. 

)& P(i using ontologies as a base for an information 
model, K tne knowledge portal is not just another resource 
catalog for a given topic, but a knowledge and data network 
that supports convenient navigation and substantial search. 

Separation of portal ontology into 
subject-independent and subject-matter ontologies makes the 
portal customizable to any field of scientific knowledge. 

On the base of the proposed model, together with the 
Institute of Archeology and Ethnography of the SB RAS, an 
archaeological knowledge portal [8] was developed, 
providing substantial access to a wide range of users for 
\ systematic knowledge and information resources in 
/<NiiUlVfe)t)K)gy and ethnography. Later, this approach was apphed 
in the development of a knowledge portal on computer 
linguistics [9]. 
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